TYCOON: Theoretical Framework and Software Tools for Multimodal Interfaces

نویسنده

  • Jean-Claude Martin
چکیده

W e define a modality as a process analyzing and producing chunks of information. For instance, a speech recognition modality analyses speech signals and produces the labels of recognized words. Several multimodal interfaces combining such modalities have already been developed (IMMI'95, CMC'95). To take benefit out of them so as to advance research and implementation of multimodal interfaces, coherent theoretical and software tools are needed. From the " theoretical " point of view, the development of multimodal interfaces addresses several issues (Maybury 91, Dowell 95): content selection (" what to say "), modality allocation (" which modality to say it "), modality realization (" how to say it in that modality ") and modality combination. This paper deals with the " modality combination " issue. A multimodal interface developer has to know how to combine modalities and why this combination may improve the interaction. Yet existing frameworks for human-computer interfaces do not answer these two questions. Instead, they deal with the relation between the modes (language or action), the channels (audio, visual or haptic), the media (speech, text or gesture) and the styles of interaction (command language, selection in a menu) (Frohlich 91). Other frameworks describe the specificities of each modality regarding information content (Bernsen 95) or the temporal and semantic relations between events detected on several modalities (Nigay & Coutaz 93, Kara-giannidis 95). From the " software tools " points of view, existing authoring tools enable only the multimedia developer to combine modalities on temporal and spatial dimensions. A common deficiency of these tools is the lack of support mechanisms for the design and implementation tasks (Väänänen 95). This paper describes our approach named TYCOON, which is based on the notion of TYpes and goals of COOperatioN between modalities. It covers both the theoretical and the software points of view. It is composed of a theoretical framework for studying multimodal interfaces, a specification language and a multimodal module integrating events detected by several modalities.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Annotating and Measuring Multimodal Behaviour - Tycoon Metrics in the Anvil Tool

We demonstrate how the Tycoon framework can be put to practice with the Anvil tool in a concrete case study. Tycoon offers a coding scheme and analysis metrics for multimodal communication scenarios. Anvil is a generic, extensible and ergonomically designed annotation tool for videos. In this paper, we describe the Anvil tool, the Tycoon scheme/metrics, and their implementation in Anvil for a v...

متن کامل

MiNT: Multimodal Interaction for Modeling and Model Refactoring

The development of software brings together participants from different backgrounds, such as domain experts, analysts, designers, programmers, managers, technical writers, graphic designers, and users. No single participant can understand or control all aspects of the system under development, and thus, all participants depend on others to accomplish their work. Moreover, any change in the syst...

متن کامل

Multimodal Interfaces: A Survey of Principles, Models and Frameworks

The grand challenge of multimodal interface creation is to build reliable processing systems able to analyze and understand multiple communication means in real-time. This opens a number of associated issues covered by this chapter, such as heterogeneous data types fusion, architectures for real-time processing, dialog management, machine learning for multimodal interaction, modeling languages,...

متن کامل

TYCOON: Six Primitive Types of Cooperation for Observing, Evaluating, and Specifying Cooperations

In this paper, we describe TYCOON, a model that we are developing for observing, evaluating and specifying cooperations between software and human agents. This model is based on a typology made of six primitive types of cooperations: equivalence, specialization, transfer, redundancy, complementarity and concurrency. Each of these types may be involved in several goals of cooperation such as ena...

متن کامل

Software Implications for Multimodal User Interfaces

This paper discusses software considerations for multimodal user interfaces, that is, systems able to support human-to-human modalities of communication (such as gesture and natural language). A definition and a classification of multimodal systems is proposed and the distinction between multimodal and multimedia systems is clarified. Then multiagent models and techniques used in graphical user...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997